Picture for Bin Zhao

Bin Zhao

Q-GeoMem: Question-Guided Geometric Memory for Video Spatial Reasoning

Add code
May 26, 2026
Viaarxiv icon

VeloGauss: Learning Physically Consistent Gaussian Velocity Fields from Videos

Add code
May 11, 2026
Viaarxiv icon

Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot

Add code
Apr 23, 2026
Viaarxiv icon

RPG: Robust Policy Gating for Smooth Multi-Skill Transitions in Humanoid Fighting

Add code
Apr 23, 2026
Viaarxiv icon

ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Add code
Mar 10, 2026
Viaarxiv icon

Mean-Flow based One-Step Vision-Language-Action

Add code
Mar 02, 2026
Viaarxiv icon

Closed-Loop Action Chunks with Dynamic Corrections for Training-Free Diffusion Policy

Add code
Mar 02, 2026
Viaarxiv icon

Understanding Degradation with Vision Language Model

Add code
Feb 04, 2026
Viaarxiv icon

Gene regulatory network inference algorithm based on spectral signed directed graph convolution

Add code
Dec 12, 2025
Viaarxiv icon

Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Add code
Dec 11, 2025
Viaarxiv icon